The statistical distribution of nucleic acid similarities.
نویسندگان
چکیده
All pairs of a large set of known vertebrate DNA sequences were searched by computer for most similar segments. Analysis of this data shows that the computed similarity scores are distributed proportionally to the logarithm of the product of the lengths of the sequences involved. This distribution is closely related to recent results of Erdos and others on the longest run of heads in coin tossing. A simple rule is derived for determination of statistical significance of the similarity scores and to assist in relating statistical and biological significance.
منابع مشابه
On the statistical significance of nucleic acid similarities
When evaluating sequence similarities among nucleic acids by the usual methods, statistical significance is often found when the biological significance of the similarity is dubious. We demonstrate that the known statistical properties of nucleic acid sequences strongly affect the statistical distribution of similarity values when calculated by standard procedures. We propose a series of models...
متن کاملCellular Morphology and Immunologic Properties of Escherichia coli Treated With Antimicrobial Antisense Peptide Nucleic Acid
Background & Objectives: Antisense peptide nucleic acids (PNA) that target growth essential genes show potent bactericidal properties without cell lysis. We considered the possibility that whether PNA treatment influence the bacteria total nucleic acids content and apply approach to develop a new delivery system to Dendritic cells (DCs). DCs are the most potent antigen presenting cells in th...
متن کاملAccurate statistical model of comparison between multiple sequence alignments
Comparison of multiple protein sequence alignments (MSA) reveals unexpected evolutionary relations between protein families and leads to exciting predictions of spatial structure and function. The power of MSA comparison critically depends on the quality of statistical model used to rank the similarities found in a database search, so that biologically relevant relationships are discriminated f...
متن کاملSynthesis of Nitrogen Functional Derivatives of 5-substituted-6-azauracil as one of the Four Nucleobases in the Nucleic Acid of RNA
3-Arylhydrazono-2,4-dioxo-4-phenylbutanoateshave been prepared by the coupling of benzoylpyruvate with aryldiazonium chlorides.Reactions ofthe 3-arylhydrazono-2,4-dioxo-4-phenylbutanoateswith 1-aminoguanidine, semicarbazide, and thiosemicarbazidegave5-substituted 2-imino-6-azauracil (3a),6-azauracil (3b), and 2-thio-6-azauracil (3c), respectively. The analytical data of these compounds - IR, 1H...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nucleic acids research
دوره 13 2 شماره
صفحات -
تاریخ انتشار 1985